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Abstract 

Seven microsatellite loci were used to develop multilocus genotypes for 375 individuals 
of Neotoma albigula collected from 32 localities throughout Arizona. Twelve of 32 localities in 
this study contained arenavirus antibody-positive individuals. Several statistical analyses were 
used to determine genetic structure, levels of genetic variability, and degree of relatedness in 
order to assess the effects of the regional gene pool on the presence or absence of arenaviruses. 
Degree of relatedness was used as a proxy for familial susceptibility within a gene pool. The 
F st value (0.110) indicated moderate genetic differentiation among localities. All localities 
displayed low to moderate levels of genetic diversity in terms of mean observed heterozygosity 
(0.357-0.787) and mean polymorphic information content (0.256-0.775). Mean relatedness 
values were slightly negative for all localities, signifying that individuals within localities were 
less related than individuals taken from a locality at random. Comparison of genetic diversity 
and relatedness values between antibody-positive and antibody-negative localities indicated no 
differences among the sites. This suggests that the presence of arenaviruses in certain localities 
is not associated with variation in genetic diversity or relatedness as detected by these markers. 

Key words: genetic variation, microsatellites, Neotoma albigula , population genetic 
structure, probability of identity, relatedness 


Introduction 


Neotoma albigula (White-throated Woodrat) is a 
wide ranging species (Hall 1981; Macedo and Mares 
1988) distributed in southern California, Baja Califor¬ 
nia, southern portions of Utah and Colorado, Arizona, 
western New Mexico, and northern Mexico (Edwards 
et al. 2001). Edwards et al. (2001) used DNA sequence 


data to split N. albigula into two distinct species, N. 
albigula and N. leucodon (White-toothed Woodrat). 
Neotoma albigula typically is found in arid areas in a 
wide variety of habitats including juniper-pinyon wood¬ 
lands, rocky outcrops, and in association with various 
cactus species ( Opuntia ; Macedo and Mares 1988; 
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citations therein). In Arizona, N. albigula often are 
found in piny on-juniper woodlands, and in association 
with cholla and prickly pear cactus (Hoffmeister 1986). 

This species is naturally associated with White- 
water Arroyo Virus (WWAV) and other arenaviral spe¬ 
cies (Fulhorstetal. 1996; Kosoyetal. 1996; Calisheret 
al. 2001; Abbott et al. 2004), hepatitis E virus (Favorov 
et al. 2000), Leishmania mexicana (the protozoan that 
causes cutaneous leishmaniasis; Kerr et al. 1999), and 
hantaviruses (Mantooth et al. 2001), among others. 
Fulhorst et al. (1996) isolated the WWAV prototype 
strain AV9310135 from N. albigula from Whitewater 
Arroyo in McKinley County, New Mexico. Subse¬ 
quently, strains of WWAV or WWA-like viruses have 
been isolated from TV. macrotis (Cajimat et al. 2007b; 
Milazzo et al. 2015), N. albigula (Abbott et al. 2004; 
Milazzo et al. 2008), N. mexicana (Cajimat et al. 2008, 
2011; Inizan et al. 2010), and N. micropus (Fulhorst et 
al. 2002; Cajimat et al. 2007a, 2011, 2013; Milazzo et 
al. 2010, 2013). 

Abbott et al. (2004) examined 2,434 rodent 
samples collected from localities throughout Arizona, 
including 1,250 N. albigula samples. Nine percent 
(112/1,250) of these samples were antibody-positive 
against WWAV in an indirect fluorescent antibody test; 
including up to 24 individuals from a single locality. 
Additionally, samples of N. albigula from 12 of 32 
collection sites were positive for arenavirus antibodies. 
This study focused on a subset of375 samples from the 
Abbott et al. (2004) study collected from 32 localities. 
Animals used in this study were found in juniper- 
pinyon woodlands, montane conifer forests, Sonoran 
Desert scrub—Arizona upland, Mohave Desert scrub, 
semi-desert scrub grassland, juniper-pinyon chaparral 
woodland, and Sonoran Desert scrub—lower Colorado 
habitats (Abbott et al. 2004). Several individuals also 
were collected in a citrus orchard. Abbott et al. (2004) 
reported no statistical association between habitat type 
and arenavirus prevalence in a given locality. 


Because some sites contained antibody-positive 
individuals, whereas others did not, this study presents 
the opportunity to examine genetic diversity and relat¬ 
edness in antibody-positive versus antibody-negative 
localities. We compared population genetic parameters 
among sampling localities that were identified by 
Abbott et al. (2004) as containing antibody-positive 
individuals to those localities that did not contain 
antibody-positive individuals to test for effects of the 
regional gene pool on presence or absence of the virus. 
The specific objectives of this study were to: 1) exam¬ 
ine genetic substructure; 2) examine levels of genetic 
diversity within and among localities; 3) compare lev¬ 
els of genetic diversity between localities containing 
antibody-positive individuals to those not containing 
antibody-positive individuals to determine if the gene 
pools between the two groups differed; 4) determine 
degree of genetic relatedness within and among locali¬ 
ties; and 5) compare the levels of genetic relatedness 
between localities that did not contain antibody-positive 
individuals to those that did contain antibody-positive 
individuals to determine if the degree of relatedness 
(as a proxy for familial susceptibility) differed between 
the two types of sites. To achieve these objectives, 
multilocus microsatellite genotypes were developed 
for individuals collected from localities throughout 
Arizona. Microsatellites have been used to study host 
population genetics in species that carry diseases such 
as malaria (Lehmann et al. 1996; Walton et al. 1998; 
Donnelly et al. 1999,2001; Pinto et al. 2002; Braginets 
et al. 2003; Chen et al. 2004; Tripet et al. 2005), dengue 
fever (Ravel et al. 2001; Huber et al. 2002; Paupy et 
al. 2004), and arenaviruses (Mendez-Harclerode et al. 
2005, 2007, 2016). Several statistical analyses were 
used to determine genetic structure, levels of genetic 
variability, and degree of relatedness in order to assess 
the effects of the regional gene pool on the presence or 
absence of arenaviruses. 


Materials and Methods 


Collecting localities andDNA extraction. —Three 
hundred seventy-five individuals collected from 32 lo¬ 
calities in 10 counties throughout Arizona (Table 1, Fig. 


1, Appendix) were used in this study. Voucher speci¬ 
mens and tissues for all samples were archived in the 
Natural Science Research Laboratory at the Museum 
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Table 1. Locality data for the 364 individuals, collected from 32 localities in Arizona, used in this study. For each 
locality, site number (Site, corresponding to Fig. 1), specific locality name (Name), latitude/longitude (Lat/Long), 
number of individuals (N), and presence (P) or absence (A) of arenavirus antibody-positive individuals (AABP) 
are provided. The numbers in parentheses in this column represent the number of positive individuals used in 
this study. NA indicates that no latitude/longitude data were gathered for that locality. Antibody status for each 
individual used in the study is provided in the Appendix. 


Site 

Name 

Lat/Long 

N 

AABP 

1 

AZ: Apache Co.; Three Turkey 

36° 1'44"N/109°24'46" W 

1 

A 

2 

AZ: Apache Co.; CDC 

NA 

1 

P(l) 

3 

AZ: Apache Co.; Saint Johns 

34°28'28"N/109 o 19T8"W 

6 

A 

4 

AZ: Navajo Co.; MVP Pig Farm 

34°33'28"N/110°4'37"W 

12 

P(l) 

5 

AZ: Navajo Co.; Lone Pine Reservoir 

34°20'42"N/110°4'53"W 

11 

A 

6 

AZ: Navajo Co.; Trick Tank Draw 

34°33'43"N/110°46T3"W 

10 

A 

7 

AZ: Coconino Co.; Snake Gulch 

36 o 40'18"N/l 12°22'3"W 

2 

A 

8 

AZ: Mohave Co.; Oatman 

35°1’56"N/114 0 16'51"W 

15 

A 

9 

AZ: Mohave Co.; Love Camp/Lake Alamo 

34°18 , 26"N/113°33 , 27"W 

20 

A 

10 

AZ: Yavapai Co.; Pine Flat 

35°1T2"N/112°49'59"W 

18 

P(0) 

11 

AZ: Yavapai Co.; Hillside 

34°20'40"N/112°36'59"W 

20 

A 

12 

AZ: Yavapai Co.; Wagner 

34°25'56 ,, N/112°54'57"W 

10 

P (6) 

13 

AZ: Yavapai Co.; Hassayampa 

34°20'21 ,, N/112°34 , 59"W 

10 

P (7) 

14 

AZ: Yavapai Co.; Granite Dells Ranch 

34°36'55"N/112°23'44"W 

19 

A 

15 

AZ: Yavapai Co.; Sycamore Station 

34°23'28"N/112°3T"W 

10 

P (3) 

16 

AZ: Yavapai Co.; Horseshoe Ranch 

34°15'59" N/112°3'46"W 

10 

A 

17 

AZ: Yavapai Co.; Sayer Spring 

34°1'0” N/112°39'4"W 

20 

A 

18 

AZ: Gila Co.; Barnhardt Trailhead 

34°6'8" N/111°22T6"W 

10 

P (3) 

19 

AZ: Gila Co.; Windmill Tank 

33057-2311 n/111°17 , 2"W 

10 

P (4) 

20 

AZ: Gila Co.; White Cow Mine 

33053-4911 N/111°16 , 57 ,, W 

10 

P (3) 

21 

AZ: Gila Co.; Cherry Creek 

33045149 " N/110 o 48'47"W 

7 

P (6) 

22 

AZ: Gila Co.; Gleason Flat 

33 0 46'24 11 N/110°40'30"W 

7 

A 

23 

AZ: Gila Co.; Coon Creek 

33°40'59” N/110°5r29"W 

7 

A 

24 

AZ: Gila Co.; Sierra Anchas Mountains 

NA 

2 

A 

25 

AZ: Graham Co.; Warm Springs 

33°27'6" N/110°13'33 , 'W 

5 

A 

26 

AZ: Graham Co.; Hackberry Creek 

33°23'22" N/110°2r30 ,, W 

36 

P (24) 

27 

AZ: Graham Co.; Brushy Tank 

33°22'23" N/110°18'55"W 

20 

P (12) 

28 

AZ: Greenlee Co.; San Francisco River 

33°7'30" N/109°16'47"W 

6 

A 

29 

AZ: Greenlee Co.; McDowell Road 

33°0'2r N/109°14T4"W 

10 

A 

30 

AZ: Greenlee Co.; Black Hills 

32°5'29" N/109°20 , 20"W 

19 

A 

31 

AZ: Cochise Co.; Chiracahua Mountains 

NA 

1 

A 

32 

AZ: Yuma Co.; Welton Citrus 

32°38'7" N/144°10'4"W 

19 

A 
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Figure 1. Map of Arizona showing localities where specimens of Neotoma albigula were 
collected. Site numbers correspond to data in Table 1. Black circles represent localities where 
antibody-positive individuals were collected and open circles represent localities where no 
antibody-positive individuals were collected. 


of Texas Tech University. Because we were interested 
in gene pool differences between localities and were 
not directly testing the link between specific alleles and 
viral infection, we randomly selected individuals from 
localities without knowledge of their infection status. 
Where possible, at least 20 individuals were sampled 
from each locality. Genomic DNA was extracted from 
approximately 25 mg of liver using a DNeasy Blood 
and Tissue extraction kit (Qiagen). 


Microsatellite analysis .—Twelve microsatellite 
loci (Table 2) were amplified via the polymerase chain 
reaction (PCR) using primers developed by Castleberry 
et al. (2000). PCR amplifications were conducted in 
25 pi volumes containing 1-1.5 pi genomic DNA, 0.6 
pi 10 pmol each primer, 2.5 pi 10 X PCR buffer, 1.5-2 
pi 25 mM MgCl 2 , 0.75 pi 10 mM dNTPs, and 0.25 pi 
5U/pl Taq. The thermal profile was modified from 
Castleberry et al. (2000) and consisted of a denaturation 
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Table 2. Microsatellite loci examined (Castleberry et al. 2000). Product length (PL), number of alleles (A), and 
sample size (N) for each locus for A. magister (Castleberry et al. 2000) and A. albigula (this study) are shown. 
Loci Nma02, Nma03, Nma08, Nmal 0, and Nmal2 were removed from this study due to amplification difficulties. 


Locus 


N. magister 



N. albigula 


PL 

A 

N 

PL 

A 

N 

NmaOl 

314-322 

6 

28 

304-348 

19 

367 

Nma02 

197-205 

4 

33 

NA 

NA 

NA 

Nma03 

180 

1 

12 

NA 

NA 

NA 

Nma04 

145-163 

7 

33 

130-185 

27 

365 

Nma05 

227-232 

4 

38 

208-223 

9 

367 

Nma06 

215-223 

5 

39 

198-275 

18 

367 

Nma08 

125-125 

7 

38 

NA 

NA 

NA 

NmalO 

186-224 

14 

39 

NA 

NA 

NA 

Nmall 

150-160 

8 

8 

142-214 

35 

365 

Nmal 2 

115-127 

3 

3 

NA 

NA 

NA 

Nmal4 

144-160 

7 

7 

134-176 

14 

370 

Nmal5 

120-136 

10 

10 

105-149 

19 

369 


and enzyme activation cycle at 94°C (2 min); 35 cycles 
of 94°C (30 s) denaturation, 55-57°C (30 s) annealing, 
72°C (1 min) elongation; followed by a final incubation 
at 72°C (10 min). 

Variation at individual microsatellite loci was 
examined using an Applied Biosystems 3100-Avant 
Genetic Analyzer. Reactions included 13.5-14 pi Hi-Di 
Formamide (Applied Biosystems), 0.5 pi 400HD ROX 
size standard (Applied Biosystems), and 0.5-1 pi PCR 
product. Genotypes were scored using GeneMapper 
version 3.0 software (Applied Biosystems). Alleles 
that did not amplify above a predetermined peak height 
(signal strength), were difficult to score, or appeared 
aberrant were reamplified and rescored. 

Statistical analyses .—The program Cervus 2.0 
(Marshall et al. 1998) was used to compare alleles to bin 
files generated from GeneMapper software allowing for 
determination of typing errors that may have occurred 
during data entry. Micro-Checker version 2.2.1 (Van 
Oosterhout et al. 2004) was used to test for presence of 
null alleles, large allele drop out, and error due to stutter. 


A random sample of at least 31 individuals per locus 
was genotyped twice without knowledge of previous 
scores. Using these samples, an error rate was calcu¬ 
lated by dividing the number of erroneous allele scores 
at each locus by the total number of allele scores for all 
individuals for which at least two genotypes existed. 

Structure version 2.0 software (Pritchard et al. 
2000) was used for assignment tests. Due to large 
samples sizes, individuals were assigned first to 
counties, then to localities within counties. Tests of 
group assignment were based on geographic locality 
and geographic distance from other collection sites. 
Under this approach, each locality was considered to 
represent a separate “population.” The parameters for 
all assignment tests were: bum-in length = 90,000, 
MCMC repetitions after the burn-in = 900,000, ancestry 
model = prior population information, allele frequency 
model = allele frequencies correlated, and G = 2. G 
calculates the probability of each individual having an 
ancestor that immigrated from another population. An 
individual was considered to be assigned correctly if 
it had at least an 80% probability of being included in 
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the cluster to which it originally was grouped based on 
geographic locality. 

The program Cervus 2.0 (Marshall et al. 1998) 
was used to estimate allele frequencies, observed and 
expected heterozygosity, null allele frequency, and 
polymorphic information content (PIC-index of vari¬ 
ability associated with expected heterozygosity). Prob¬ 
ability of identity (PI) was estimated with IDENTITY 
1.0 software (Wagner and Sefc 1999) using equations 
reported by Paetkau et al. (1995). This program also 
was used to identify identical genotypes among samples 
and indicate potential parent-offspring combina¬ 
tions. Pairwise and mean relatedness values for each 
population were estimated with the program Identix 
1.1 (Belkhir et al. 2002) using equations developed 
by Queller and Goodnight (1989). Mean relatedness 
values were estimated by performing 500 permutations 
on genotypic data and 95% confidence intervals were 
calculated after 100 bootstraps across loci. 

The program Fstat 2.9.3 (Goudet 2001) was used 
to estimate deviations from Hardy-Weinberg equilib¬ 
rium (HWE), linkage disequilibrium, F-statistics (Weir 
and Cockerham 1984), and R ST (Slatkin 1995; Rousset 
1996; Goodman 1997). Sequential Bonferroni correc¬ 
tions (Holm 1979; Rice 1989) were performed on all 
analyses as a function of this program. The indicative 


Five loci were removed due to amplification 
difficulties (Table 2). The remaining seven loci were 
used for all further analyses. Five individuals were 
removed from the study due to failure to amplify for 
at least five loci. All other individuals (n = 370) were 
included in population assignment analyses. Twelve 
data entry errors were detected (12/2,625 entries = 
0.457% error rate) and corrected prior to data analysis. 
Genotype scoring errors were detected at loci Nma05 
(4/276 allele calls = 1.449%), Nmal 1 (2/204 allele calls 
= 0.980%), and Nmal4 (2/138 allele calls = 1.449%). 
In all instances, two different heterozygotes calls were 
made for each sample. No evidence for scoring error 
due to stutter or large allele drop was detected at any 
locus using Micro-Checker software. The potential 
presence of null alleles was found at all loci. 


adjusted nominal level was set at 5%, following tradi¬ 
tional tests for significance at the 95% level. For all 
tests, 1,000 permutations were performed. Arlequin 
2.000 software (Schneider et al. 2000) was used to 
perform an analysis of molecular variance (AMOVA), 
which allocates percentage of genetic variation at dif¬ 
ferent hierarchical levels, using 10,000 permutations. 

Comparisons of genetic diversity and related¬ 
ness between antibody-positive and antibody-negative 
localities were performed using either the comparison- 
among-groups function of Fstat or t-tests. The program 
Fstat was used to compare observed heterozygosity, F IS , 
F st , relatedness (Hamilton 1971; Pamilo 1984, 1985), 
and gene diversity (Nei 1987) among the two groups. 
For each test, 1,000 permutations were performed. T- 
tests were used to compare expected heterozygosity, 
PIC, and number of alleles between the two groups. 
These tests were performed twice. The first group of 
tests compared all antibody-positive versus antibody¬ 
negative localities. The second group of tests compared 
all antibody-positive localities versus only those locali¬ 
ties Abbott et al. (2004) determined to be statistically 
antibody-negative. This includes sites 5,6, 8, 9,11,14, 
16, 17, 22, 23, and 28-32 (Table 1, Fig. 1). Sites 3, 7, 
24, and 25 were not included in this group as sample 
size was determined as a possible limiting factor in the 
inability to detect the presence of arenavirus antibodies. 


All but six individuals were assigned correctly to 
their respective locality with a probability of 0.800 or 
higher. Of these six individuals, two were assigned to 
localities > 320 kilometers from their original collec¬ 
tion site and four individuals were assigned to multiple 
localities with equal probability. These six individuals 
were removed and the remaining 364 samples were 
used for all further analyses. Allele calls for the 364 
individuals used in the study are provided in the Ap¬ 
pendix. Sites 1,2, and 31 contained a single individual 
and were not used in any population level analyses. 
These three individuals were included in the combined 
assessment of number of alleles, observed and expected 
heterozygosity, and PIC across all samples reported 
in Table 3. 
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Table 3. Summary statistics for each locality and all localities combined. Allele frequencies can be obtained from the 
senior author upon request. Site numbers correspond to localities in Fig. 1 and Table 1. Three localities (sites 1, 2, 
and 31) contained a single individual and were not included in any population level analyses, but were included when 
looking at these statistics across all samples (Total). Number of individuals (N), mean number of alleles (A), mean 
observed (H Q ) and expected (H E ) heterozygosity, HWE p-values over all loci (P), mean polymorphism information 
content (PIC), F IS , and mean relatedness (R) values are shown below. NA indicates that no test was performed for that 
statistic. Indicative adjusted nominal level (5%) for HWE was p < 0.001 after Bonferroni corrections. 


Site 

N 

A 

H 0 

H e 

P 

PIC 

F,s 

R 

3 

6 

5.290 

0.714 

0.720 

0.518 

0.627 

0.008 

-0.203 

4 

12 

6.710 

0.723 

0.776 

0.088 

0.713 

0.071 

-0.085 

5 

11 

4.710 

0.701 

0.710 

0.480 

0.625 

0.013 

-0.100 

6 

10 

7.140 

0.771 

0.738 

0.846 

0.669 

-0.049 

-0.110 

7 

2 

1.570 

0.357 

0.405 

0.408 

0.256 

0.167 

- 1.000 

8 

15 

6.140 

0.619 

0.651 

0.187 

0.602 

0.050 

-0.070 

9 

20 

8.000 

0.607 

0.667 

0.025 

0.625 

0.092 

-0.050 

10 

18 

6.860 

0.611 

0.673 

0.019 

0.633 

0.096 

-0.060 

11 

20 

7.140 

0.729 

0.698 

0.879 

0.648 

-0.045 

-0.053 

12 

10 

6.290 

0.671 

0.707 

0.247 

0.639 

0.053 

-0.112 

13 

10 

6.860 

0.686 

0.723 

0.213 

0.659 

0.055 

-0.113 

14 

19 

7.000 

0.616 

0.658 

0.090 

0.620 

0.064 

-0.059 

15 

10 

6.290 

0.557 

0.644 

0.015 

0.587 

0.141 

-0.107 

16 

10 

5.290 

0.600 

0.609 

0.457 

0.552 

0.016 

-0.114 

17 

20 

8.570 

0.664 

0.682 

0.289 

0.644 

0.026 

-0.056 

18 

10 

6.140 

0.714 

0.653 

0.962 

0.587 

-0.099 

-0.106 

19 

10 

5.710 

0.698 

0.673 

0.761 

0.608 

-0.040 

- 0.111 

20 

10 

5.570 

0.643 

0.643 

0.559 

0.574 

0.000 

-0.105 

21 

7 

6.000 

0.653 

0.710 

0.141 

0.626 

0.086 

-0.164 

22 

7 

6.860 

0.673 

0.830 

0.001 

0.741 

0.202 

-0.157 

23 

7 

6.570 

0.775 

0.776 

0.662 

0.684 

-0.013 

-0.151 

24 

2 

2.860 

0.786 

0.833 

0.394 

0.515 

0.100 

- 1.000 

25 

5 

5.570 

0.686 

0.794 

0.052 

0.672 

0.150 

-0.250 

26 

36 

9.430 

0.770 

0.802 

0.087 

0.766 

0.040 

-0.028 

27 

20 

10.140 

0.779 

0.820 

0.098 

0.775 

0.052 

-0.051 

28 

6 

5.430 

0.724 

0.762 

0.235 

0.657 

0.053 

-0.222 

29 

10 

7.710 

0.757 

0.798 

0.198 

0.726 

0.055 

-0.109 

30 

19 

9.430 

0.782 

0.813 

0.175 

0.767 

0.040 

-0.053 

32 

19 

7.570 

0.787 

0.769 

0.722 

0.720 

-0.023 

-0.061 

Total 

364 

19.430 

0.697 

0.811 

NA 

0.797 

NA 

NA 
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Mean number of alleles ranged from 1.570 (site 
7) to 10.140 (site 27) within the localities (Table 3). 
Mean observed and expected heterozygosities, and PIC 
are reported in Table 3. Two individuals with identi¬ 
cal genotypes were detected. These genotypes were 
confirmed, as was the identity of the original samples, 
and both samples were left in all analyses. Eight poten¬ 
tial parent-offspring groupings were detected. PI was 
1.340e -9 (1 chance in 746 million of randomly selecting 
two individuals with the same genotype). Pairwise 
relatedness values ranged from slightly negative to 
highly positive within sites. Mean relatedness values 
were negative for all sites (Table 3). 

The significance level for tests of HWE within 
sites was set at p < 0.001 after Bonferroni corrections. 
Across all loci, sites were in HWE (Table 3). The 
adjusted Bonferroni p-value for disequilibrium was 
0.002. No genotypic disequilibrium was detected (P 
> 0.007 for all pairwise comparisons). F IS ranged from 
-0.099 (site 18) to 0.202 (site 22) within sites (Table 
3). F-statistic values among sites, with 95% confi¬ 
dence intervals in parentheses, were as follows: F IT = 
0.145 (0.084, 0.223), F ST = 0.110 (0.074, 0.160), and 
F IS = 0.040 (-0.009, 0.102). Pairwise differentiation 
comparisons are shown in Table 4. Three estimators 
of R st were calculated among sites: weighted = 0.133, 
Goodman = 0.141, and unweighted = 0.140. Results 


Genetic structure .—Comparison of the F ST 
value (0.110) to guidelines provided by Wright (1978) 
indicated moderate genetic differentiation between 
sites. Pairwise differentiation values indicated 231 
significant comparisons (Table 4); although there was 
no discemable pattern among the differentiation values. 
For example, site 3, located in Apache County, was not 
significantly different from any other site with the ex¬ 
ceptions of sites 26 and 27, located in Graham County, 
and site 30, located in Greenlee County. Conversely, 
site 4 had significant pairwise differentiation values 
when compared to all sites except 22, 23, 25, and 28. 
Interestingly, site 4 contained individuals that were 
antibody-positive, whereas all individuals from sites 
22, 23, 25, and 28 were antibody-negative. As might 
be expected, site 32, which is geographically isolated 
from all other populations, was significantly different 


of the AMO VA indicated that 10.630% of the variation 
was among sites, 3.050% of the variation was among 
individuals within sites, and 86.320% of the variation 
was within individuals. 

Comparisons between antibody-positive and all 
antibody-negative localities resulted in no significant 
differences between the parameters. The results from 
Fstat analyses compared at the 5% nominal level were 
as follows: P (H 0 ) = 0.905, P (F IS ) = 0.536, P (F ST ) = 
0.856, P (Relc, corrected relatedness) = 0.536, and P 
(H s , gene diversity) = 0.418. The results of the t-tests 
compared at the 5% nominal level were as follows: t 
= -0.194, df = 27, P > 0.500 when comparing H E ; t = 
0.618, df = 27, P > 0.500 when comparing PIC; and t 
= 0.998, df = 25, P > 0.300 when comparing number of 
alleles. Comparisons between the two groups after sites 
3,7,24, and 25 were removed resulted in no significant 
differences between the groups. The results from Fstat 
analyses compared at the 5% nominal level were as fol¬ 
lows: P (H 0 ) = 0.969, P (F IS ) = 0.706, P (F ST ) = 0.976, 
P (Relc, corrected relatedness) = 0.707, and P (H s , gene 
diversity) = 0.855. The results of the t-tests compared 
at the 5% nominal level were as follows: t = -0.556, 
df = 39, P > 0.500 when comparing H E ; t = -0.407, df 
= 20, P > 0.500 when comparing PIC; and t = -0.106, 
df = 20, P > 0.500 when comparing number of alleles. 


from all other sites with the exception of site 3. Overall, 
those sites located on the geographic perimeter of the 
sampling area (i.e., site 32) tended to be genetically 
distinct from other populations, whereas those sites 
that were clustered together (i.e., sites 18-24) tended to 
be genetically similar to one another, but distinct from 
locations outside the cluster. In addition to geographic 
locality, sample size as it relates to potential genetic 
variation might also have played a role. For example, 
site 3 only contained six individuals. If the six indi¬ 
viduals selected had common genotypes, the genetic 
diversity within the population would be lower and they 
would not differ genetically from other populations. 

Genetic variation .—All sites possessed low to 
moderate levels of genetic variability based on values 
for observed heterozygosity and PIC (Table 3). The PI 


Table 4. Pairwise differentiation comparisons for localities of Neotoma aJbigula. Those sites that contained only one or two individuals (sites 1, 2, 7, 24, 
31) were not included in the analysis. Pairwise significance values after Bonferroni corrections (a < 0.001) were < 0.001 for all significant comparisons. 
(*) indicates significance at the 5% level and NS indicates non-significant comparisons. Site numbers correspond to localities in Table 1 and Fig. 1. 
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(1 chance in 746 million of randomly selecting two indi¬ 
viduals with the same genotype) also was low compared 
to other species of Neotoma (Haynie et al. 2007,2009). 
Small samples sizes for most of the populations were 
reflected in the low levels of variation. Adding to the 
low levels of variability was the dominance of a single 
or a few alleles at several loci, especially locus Nmal4. 
Most allele calls at this locus (73.78%) represented a 
single allele, thus most individuals were fixed for a 
single allele at this locus thereby decreasing genetic 
variation. Results of the AMOVA indicated that most 
of the genetic variation was within sites. However, the 
amount of variation among sites (~ 11 %) supported the 
findings that there is some genetic structure and several 
sites were genetically different from one another. 

Relatedness .—Pairwise relatedness values 
ranged from slightly negative to highly positive within 
sites, with mean relatedness values being negative for 
all sites (Table 3). Negative relatedness values indicate 
pairs of individuals in these sites are less related to one 
another than are pairs of individuals taken from a popu¬ 
lation at random. Despite negative mean relatedness 
values, some individuals within sites did show some 
degree of relatedness. Relatedness values ranged from 
as low as 0.002 (site 6; indicative of a distant cousin 
relationship) to as high as 0.756 (site 26; indicative 
of parent-offspring or sibling relationship). Eight po¬ 
tential parent-offspring groupings were detected, one 
each within sites 5, 6, 9, and 19 and two each within 
sites 10 and 26. 

Mode of transmission in arenaviruses still is in 
question, although some clues have arisen. Fulhorst et 
al. (2001), in a laboratory experiment, determined that 
viral transmission could occur both vertically (parent 
to offspring) and horizontally (between contemporary 
individuals). Calisher et al. (2001), in a study of wild 
woodrats, determined that transmission between ro¬ 
dents was through direct contact. Abbott et al. (2004) 
determined that there was no association between being 
antibody-positive and aggressive behavior between 
individuals, based on skin wounds, for the individuals 
used in this study. They also determined that there was 
no relationship between age or sex classes and being 
antibody-positive. Abbott et al. (2004) concluded that 
vertical transmission is an important process in virus 
transmission in natural populations. If vertical transi- 
mission is important, it can be predicted that localities 


with a high degree of arenavirus prevalence will have 
a high degree of relatedness, suggesting familial sus¬ 
ceptibility. Preliminary assessment of the correlation 
between relatedness and antibody status does not indi¬ 
cate a link between the two (data not shown), although 
that does not mean that a link is not present, simply that 
it was not detected using these markers. 

The lack of closely related individuals within 
these sites is not surprising. Similar patterns of related¬ 
ness values have been found in N. macrotis (Matocq 
and Lacey 2004; Haynie et al. 2007), N. fuscipes 
(Haynie et al. 2007), and N stephensi (Haynie et al. 
2009). In addition, Matocq and Lacey (2004) found 
that females, typically thought to be closely related to 
neighboring females, actually were not closely related 
nor philopatric. This pattern has not been studied in 
N. albigula , but it may explain low relatedness values. 
Additionally, the sampling strategy of this study was 
not aimed at collecting all neighboring individuals and 
therefore may have affected relatedness values. 

Arenavirus association. —Of the 32 localities 
studied, 12 contained individuals positive for arenavi¬ 
rus antibodies. Most of the positive localities were lo¬ 
cated centrally within the state (Fig. 1). However, there 
was no clear pattern as to the presence or absence of 
the virus based on geographic location. Localities that 
contained antibody-positive individuals were relatively 
close to sites that did not contain antibody-positive 
individuals. There are several possible explanations 
for the distribution of the virus in the study area. First, 
there may be a link between habitat type and virus sus¬ 
ceptibility. However, Abbott et al. (2004) tested this 
possibility and found no such relationship within the 
localities studied. Second, there may be a genetic link 
to susceptibility. Based on our analyses, the levels of 
genetic differentiation and genetic diversity between 
localities did not appear to be affected by the presence 
or absence of arenavirus-positive individuals at the 
site. Although comparisons of antibody-positive and 
antibody-negative localities indicated that there was no 
difference in genetic variation and relatedness between 
the two, the markers used in this study are not directly 
tied to virus susceptibility. Additionally, we were 
not assessing genetic differences between antibody¬ 
positive and antibody-negative individuals, but rather 
differences between the gene pools of sites contain¬ 
ing positive individuals to sites that did not. Further 
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analyses and the utilization of markers directly tied to 
immunity and virus uptake in the host are warranted. 

Additional explanations exist for the distribu¬ 
tion of viruses seen in this study, beyond those tested. 
The presence of antibody-positive individuals in some 
populations and not in others could be the result of a 
founder effect. Virus-positive individuals could have 
moved into certain localities and not others, thus bring¬ 
ing the virus to certain populations. This idea currently 
is beyond the scope of this study, but remains a possible 
explanation for the distribution of the virus. It also is 
possible that some virus-positive localities that were 
distantly separated from other virus-positive localities 
(e.g.. Sites 2 and 4) represent viral refugia. Again, this 
explanation is beyond the scope of this study, but war¬ 
rants further investigation. Finally, it may be that more 
populations contained antibody-positive specimens and 
these individuals simply were not collected or the virus 
was not detected. Abbott et al. (2004) determined that 


sample size could have been a limiting factor at four 
sites (sites 3,7,24, and 25) for which no virus antibody 
was detected. However, 15 sites (sites 5, 6, 8, 9, 11, 
14, 16, 17, 22, 23, and 28-32) were determined to be 
statistically antibody-negative. 

Conclusions.—Neotoma albigula is a widespread 
and readily abundant species which has the possibility 
of easily coming into contact with humans, especially in 
the central portion of Arizona. These samples and this 
species warrant further study due to the fact that there 
may be multiple arenavirus strains associated with N. 
albigula (Milazzo et al. 2015). Mark-recapture studies 
or other sampling methods that would allow for the de¬ 
velopment of a pedigree within these populations may 
help address the question pertaining to the route of viral 
transmission. This study provides the basic population 
genetic groundwork for this species and should serve 
as a stepping-stone for further investigations. 
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